Fault-Tolerant Systolic Array Design With Partially- Reconfigurable FPGAs
نویسندگان
چکیده
Partial dynamic reconfiguration, PDR, is an important feature of modern reconfigurable architectures such as the Xilinx Virtex FPGA devices. In this paper we focus on the application of PDR to fault-tolerant systolic arrays in one dimension. Single as well as multiple faults in one or more array processing elements, PEs, are considered. In our approach modular redundancy is not used and the goal is to maintain total processing latency of the array at the original non-faulty value. This avoids stalling the data input due to reconfigurations of one set of PEs to another set to work around the fault. We consider arrays of N PEs with all possible single-fault locations and formulate an integer linear programming, ILP, model and constraints for each case. The optimal solution selects the PE to reconfigure as a temporary replacement of the faulty PE, so that total processing latency is not increased when the processing clock rate is higher than the data arrival rate. We also develop a heuristic method for solving the ILP problem, which overlaps as many reconfiguration and data processing tasks as possible. The runtime of the heuristic method is very reasonable and matches the exact ILP solution in each case. Experimental results for a single-fault N PE array show that our heuristic finds optimal temporary reconfigurations so that input data does not have to stall.
منابع مشابه
Fault-Tolerant Implementation of Systolic Arrays
In this paper, several fault-tolerant hardware implementations of Systolic Arrays on reconfigurable devices, such as FPGAs, are investigated and primal selection criteria such as resource utilization, computation time, advantages and disadvantages of every fault-tolerant method are presented. Since the main goal of the designs presented herein is the high performance of the implementations, the...
متن کاملA (Fault-Tolerant)2 Scheduler for Real-Time HW Tasks
This paper describes a fault-tolerant scheduler that uses the AreaTime response Balancing algorithm (ATB) for scheduling real-time hardware tasks onto partially reconfigurable FPGAs. The architecture of the ATB scheduler incorporates fault-tolerance by design features; including Triple Modular Redundancy (TMR), parity protection of its memories and finite state machines, as well as spatial and ...
متن کاملEvolutionary Dynamic Allocation of Relocatable Modules onto Partially Damaged Xilinx FPGAs
Dynamically and partially reconfigurable FieldProgrammable Gate Arrays (FPGAs) open the doors for new more advanced fault-tolerance techniques. This technology permits indeed to adjust at runtime the architecture of the system in order to keep it fault-free at every time. We present a tailored Genetic Algorithm (GA) aimed at online searching a fault-tolerant architecture for the system in terms...
متن کاملAn Overview of Different TMR Design Techniques with Comparison to Alternative Fault- Tolerant Design Methods for FPGAs
With growing interest in the use of SRAM-based FPGAs in space and other radiation environments, there is a greater need for efficient and effective fault-tolerant design techniques specific to FPGAs. Triple-modular redundancy (TMR) is a common fault mitigation technique for FPGAs and has been successfully demonstrated by several organizations. This technique, however, requires significant hardw...
متن کاملA Domain Partition Model Approach to the Online Fault Recovery of FPGA-Based Reconfigurable Systems
Field programmable gate arrays (FPGAs) are widely used in reliability-critical systems due to their reconfiguration ability. However, with the shrinking device feature size and increasing die area, nowadays FPGAs can be deeply affected by the errors induced by electromigration and radiation. To improve the reliability of FPGA-based reconfigurable systems, a permanent fault recovery approach usi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011